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Abstract 


Feature selection is the process of picking the most effective feature among a considerable number of features in the 
dataset. However, choosing the best subset that gives a higher performance in classification is challenging. This study 
constructed and validated multiple metaheuristic algorithms to optimize Machine Learning (ML) models in diagnosing 
Alzheimer’s. This study aims to classify Cognitively Normal (CN), Mild Cognitive Impairment (MCI), and Alzheimer’s 
by selecting the best features. The features include Freesurfer features extracted from Magnetic Resonance Imaging 
(MRI) images and clinical data. We have used well-known ML algorithms for classifying, and after that, we used multiple 
metaheuristic methods for feature selection and optimizing the objective function of the classification. We considered 
the objective function a macro-average F1 score because of the imbalanced data. Our procedure not only reduces the 
irreverent features but also increases the classification performance. Results showed that metaheuristic algorithms could 
improve the performance of ML methods in diagnosing Alzheimer’s by 20%. We found that classification performance 
can be significantly enhanced by using appropriate metaheuristic algorithms. Metaheuristic algorithms can help find the 
best features for medical classification problems, especially Alzheimer’s. 


Keywords: Metaheuristic algorithm, Alzheimer’s disease, MRI, Machine learning, Feature selection, Data mining. 


1 | Introduction 


QO) Nowadays, a considerable amount of data is generated daily, and data-related technologies are 
T advancing rapidly to cope. Data Mining (DM) is one of the methods to conclude helpful information 
International Journal of . : : . . He 
Research in Industrial | from this massive amount of data with the help of Machine Learning (ML), pattern recognition, 
Engineering. This article | algorithms, and statistics. ML algorithms employed in solving different problems [1] specially disease 


is an open access article | diagnosis. Alzheimer’s Disease (AD) is one of the problems, with a large amount of data produced 
distributed under the 


for diagnosing and treating it. The exact causes of AD are not yet clear. However, there are some risk 
terms and conditions of 


factors with proven effects on the cognitive function of the brain such as White Matter (WM), Gray 
Matter (GM), and Cerebrospinal Fluid (CSF) CerebroSpinal Fluids (CSFs) [2]. These biomarkers can 
be found by examining different types of imaging modalities like Magnetic Resonance Imaging 
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Rating (CDR), Mini-Mental State Examinations (MMSE), and AD Assessment Scale Cognitive (ADAS- 
Cog) [3]. However, physicians still have difficulty predicting Alzheimer’s because these biomarker 
vatiations are difficult to generalize. This is where DM and ML models, that are trained on multiple 
factors, can help in diagnosing Alzheimer’s [4], [5]. Raw data from MRI scans need to be adjusted and 
pteprossedsed to be used by before-mentioned models. Preprossessing data also helps with models 
accuracy [6]. One commonly used methods in data preprocessing is Feature Selection (FS) [6], which 
tries to select the smallest and most valuable features from data, leading to better classification 
performance. As well as performance, FS can speed up the decision-making process. The FS process is 
considered an NP-hard searching problem [7]. When there are n features, it will be a 2n subset of features 
that can be used, and when n is large, the computation cost increases. Therefore, for high-dimensional 
data, exhaustively generating all possible subsets becomes impractical and computationally intensive. 
Forward sequential, backward sequential, random, and heuristic searches are all examples of search 
strategies that can be used to improve the efficiency of the FS method. Recently, metaheutistic 
algorithms have been successfully applied to many FS methods [8]. As there is still a lot of work to do 
to find the main features of AD in this paper, we proposed a novel method for selecting the best features 
among many extracted from MRI images in combination with other health factors. Our proposed 
approach focuses on the choice of feature selection algorithm to overcome computation cost problems 
employimg metaheuristic methods. Also, classified AD into three different groups. The article is planned 
as follows. The next section discusses the literature review of other authors who have used DM and ML 
algorithms to analyze Alzheimer’s. Section 3 describes the proposed technique used for feature 
extraction from MRI images and optimizing the result by metaheuristics. Section 4 discusses the 
experiments and evaluation results. Finally, Section 5 presents the paper summary and conclusions. 


Table 1. Demographic details of OASIS-3 subjects. 


Male Female 
Number of subjects 487 611 
Age 70.17 (42.5-91.7) 67.78 (43.2-95.6) 
Right-handed 433 546 


2 | Literature Review 


Healthcare costs today are much higher due to technological advances and demographic changes. 
Planning and managing treatment resources and facilities is critical to controlling and reducing these 
costs and providing the desired services to patients [9]. Multiple criteria are needed to be managed in 
health care like infection control surveillance, diagnosis and treatment of various diseases, healthcare 
resoutce management, customer relationship management, healthcare administration, hospital 
management and public health. 


Artificial Intelligence (AI) models played a successful role in health care issues, especially managing 
detection, determination, and disease prediction costs [10]. Hariri et al. [11] developed a model for 
diagnosis of hear failure using AI models to manage treatment costs. In all the AI models developed for 
disease classification, feature selection was the most challenging task. It involves selecting the most 
distinct and relevant subset of features from a large set of features to represent the dataset. In recent 
years we have seen studies on feature selection in medical imaging. For example, an ML model 
developed by Zhang et al. [5] combines AD baseline features from MRI images, hypometabolism, and 
CSF to classify AD vs CN. In the proposed model, they used a multiple-kernel support vector machine. 
Non-imaging biomarkers with CSF and clinical data were also used to develop and compare ML models 
to classify CN versus Mild Cognitive Impairment (MCI) [12]. More recently, Stamate et al. [13] compared 
three ML models to classify CN versus AD, which XGBoost claimed to be the best one that used blood 
metabolite data with clinical and cognitive data. Demir et al. [14] implemented a convolutional neural 
network to extract features from MRI images. Multiple methods have been proposed to solve the feature 
selection problem, such as random and greedy search, exhaustive search, etc. Most of these methods 
suffer significant complications and high computational costs. 
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How metaheuristic algorithms can help in feature selection for Alzheimer’s diagnosis 


Metaheuristics provide practical solutions in a considerable amount of time, optimizing classification 
performance and overcoming the curse of dimensionality by alleviating the consumption of large functions 
such as computational resources, storage, etc. Bahmani et al. [15] employed improved Whale Optimization 
Algorithm (WOA) and for flow shop scheduling and vehicle routing and showed efficiency of improved 
WOA in converging to optimal solution and achieve better solution in comparison to the Genetic 
Algorithm (GA). Dirik [16] applied fuzzy GA to detection of counterfeit banknotes. Zhang et al. [17] fit 
GA to solve the appointment scheduling problem. 


Therefore, metaheutistic algorithms as a solution received extra attention in classification. For example, 
Negahbani et al. [18] examined coronary artery disease and proposed an evolution based search algorithm 
with fuzzy c-means classifier. Al-Tashi et al. [19] used Grey Wolf Optimizer (GWO) to select the best 
features and used SVM as a classifier for diagnosing cardiovascular disease. Mafarja and Mirjalili [20] 
applied Simulated Annealing (SA) algorithm as a local search method to enhance the WOA to choose the 
optimal subset of features to develop classification accuracy on datasets from the UCI data repository [21]. 


Overall, studies are conducted extensively on various modalities for AD diagnosis, but still, researchers are 
facing difficulties with finding the best features. Therefore, an active feature selection algorithm is essential 
in identifying key features. Also, most of the research concentrated on 2 class classifications and did not 
consider the MCI phase, which is a more critical phase in AD diagnosis. 


3 | Research Methodology 


In this section, we will discuss the preprocessing methods and steps we did for the classification and 
optimization of the classification method by feature selection. In the first step, we compared well-known 
algorithms of ML; Adaboost, GradientBoost, XGBoost, Light GBM, and Gaussian Naive Bayes for our 
data classification with 192 features extracted from brain images. In the next step, we optimize the ML 
algorithm using metaheuristic algorithms for feature selection. 


The data we have used is from the OASIS-3 dataset, a newly introduced dataset for AD. It was prepared 
across studies in Washington University Knight Alzheimer disease research center over 15 years. 
Demographic information about it is illustrated in Tabé 7. Participants in this study ranging from 42 to 95 
years old including Cognitively Normal (CN) and different stages of cognitive decline adults. OASIS-3 
dataset contains over 2000 MR sessions, including structural and functional sequences. Imaging data is 
accompanied by dementia, and APOE status, which is the strongest genetic risk factor for AD diagnosis 
suggested in multiple papers [22]—[24], and longitudinal clinical and cognitive outcomes. 


Table 2. Features description. 


Variable Description 

SubCortGrayVol Subcortical GM volume 

CortexVol Total cortical GM volume 
IntraCranialVol ICV, intracranial volume 
CorticalWhiteMatterVol Total cortical WM volume 
TotalGrayVol Total GM volume 

SupraTentorialVol Supratentorial volume 

IhCortexVol Left hemisphere cortical GM volume 
rhCortexVol Right hemisphere cortical GM volume 
IhCortical WM Vol Left hemisphere cortical WM volume 
rhCortical WM Vol Right hemisphere cortical WM volume 
APOE Apolipoprotein E 


MMSE Mini-mental state examination 


Table 3. CDR details of OASIS-3 subjects. 
Categories Max CDR=0 Max CDR=0.5 Max CDR=1 Max CDR>2 


Min CDR=0 605 192 39 14 
Min CDR=0.5 - 66 61 50 
Min CDR>1 - - 31 36 
Total 605 258 131 100 


3.1 | Preprocessing 


In this study, we used features extracted from T1w MRI images. These images were processed through 
Freesurfer. PreeSurfer is an imaging tool and the most widely used software for analysing brain images. 
It is used for the depiction of the cortical surface between white and GM, segmentation of WM from 
the rest of the brain, skull stripping, and many other purposes. We have used the statistical output from 
the subcortical segmentation. We also considered APOE and MMSE values because they are important 
risk factors for AD diagnosis. Some of the features we have used are described in Table 3. Overall, there 
were 2047 records, and from that, we eliminated records with missing APOE or MMSE. 


We have created three classes based on the provided CDR scores in which a CDR value equaling zero 
represents the non-existence of Alzheimer’s, a CDR equaling 0.5 represents MCI subjects, and a CDR 
greater than 0.5 represents the presence of AD. Detailed information about CDR values is shown in 
Table 2. Some patients have multiple records because they had multiple visits; we didn’t want to have 
any overlap between train and test dataset. To overcome this problem for each subject, we kept the 
records with the biggest CDR value indicating Alzheimer’s or MCI. After these steps, the number of 
data became 1006 records and included 659 CN, 265 MCI, and 82 severe Alzheimer’s (AD). It can be 
seen that data distribution is imbalanced. We split the data as 70% for training and 30% for testing. 


The impact of class imbalance on classification performance is a major issue [25], [26]. In these studies, 
authors discussed the impact of im-balanced classification on accuracy using various examples and 
showed reporting classification accuracy for a severely imbalanced classification problem could be 
dangerously misleading. Ac-curacy is not considered a good evaluation measure as the majority class 
overwhelms the errors of the minority class. It is appropriate to use macro-averaging metrics over micro- 
averaging to avoid a dominance of majority classes [27]. In this work, we used macro average F1 score 
as the performance metric and the objective function for optimizing. F1 score is the variant most often 
used when learning from imbalanced data, which weights precision and recall equally [28]. After 
preprocessing the data for discrimination of CN, MCI and AD we used Adaboost [29], Gradient Boost 
[30], XGBoost [31], Light GBM, and Gaussian Naive Bayes [32] classifiers with all the extracted features. 


Table 4. Metaheuristic algorithms performance. 


Classifier Default PSO GWO DFO GA 
F1- NSF Fl- NSF Fl- NSF Fl- NSF Fl- NSF 
MAVG MAVG MAVG MAVG MAVG 
AdaBoost 62.55% 198 78.04% 90 76.56% 181 77.00% 94 76.77% 198 
Gradient 78.09% 198 83.81% 96 82.40% 192 82.90% 112 83.13% 198 
boosting 
XGBoost 75.65% 198 85.13% 105 84.32% 189 83.60% 91 84.08% 198 
LightGBM 75.14% 198 82.97% 89 82.80% 183 82.00% 102 82.37% 198 
Gaussian naive 52.92% 198 72.30% 87 67.72% 148 69.20% 87 66.29% 198 
bayes 


*Particle Swarm Optimization (PSO), Grey Wolf Optimizer (GWO), Dragonfly Optimization (DFO), Genetic 
Algorithm (GA), F1 score Macro Average (F1 MAVG), Number of Selected Features (NSF). 


IJRIE 


200 


Salami et al. | Int. J. Res. Ind. Eng. 12(2) (2023) 197-204 


IJRIE 


201 


How metaheuristic algorithms can help in feature selection for Alzheimer’s diagnosis 
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Fig. 1. Different meta-heuristic algorithms' performance on feature selection and different classifiers. 


3.2 | Feature Selection 


Feature selection has been considered to be an NP-hard problem [33] as it is a challenging problem to find 
the best subset of features. Therefore, more like other NP-hard problems, a high-performance 
metaheuristic method is required to reduce processing time. The proposed method in this study aims to 
compare and propose the best metaheuristic method to find the optimal combination of features. We used 
Particle Swarm Optimization (PSO), GWO, Dragonfly Optimization (DFO) Algorithm and GA as our 
metaheuristics algorithms because of their exploration and exploitation ability. PSO is a computational 
method that optimizes an objective function by iteratively improving candidate solutions with respect to a 
given quality measure. It takes a population of candidate solutions and solves the problem by moving these 
particles in the search space according to a simple formula via their positions and velocities. Each particle’s 
motion is influenced by its best-known position locally. It also leads to the best-known position in the 
seatch space that is updated as better positions found by other particles. GWO mimics the natural gray 
wolf leadership hierarchy and hunting mechanisms. GWO implements and optimizes multiple steps: hunt, 
loot search, loot siege and loot attack. The DFO algorithm is derived from static and dynamic swarm 
behavior. These two herd behaviors ate very similar to exploration and exploitation in the metaheuristic 
phase. Dragonflies form subflocks and fly in static flocks over different regions, which is the main purpose 
of the exploration phase. In static flocks, on the other hand, dragonflies fly in one direction in larger flocks. 
This is an advantage during the recovery phase. GA was inspired by the process of natural selection, which 
belongs to the larger class of Evolutionary Algorithms (EAs). GAs, which rely on biologically-inspired 
operators such as mutation, crossover, and selection, are widely used to generate high-quality solutions to 
optimization and search problems. We considered the objective function as macro average F1 score. For 


each algorithm, we have trained for 20 iterations to reach a maximum macro average F1 score in 
Alzheimer’s classification. 


OASIS-3 has been used in experiments that demonstrated the proposed approach produces a statistically 
significant compact set of features and improves the F1 score in classification. Results showed that PSO 
outperformed other methods in selecting several features and leads to improving classification metrics. 


4 | Results and Discussion 


In this research, multiple experiments are set up to analyze the performance of metaheuristic feature 
selection methods among different classifiers for diagnosing Alzheimer’s. Five benchmark classifiers 
AdaBoost, Gradient Boosting, XGBoost, LightGBM, and Gaussian Naive bayes were applied in these 
experiments to evaluate the performance of feature selection methods. This section discusses the 
experimental results of different classifiers with PSO, GWO, DFO, and GA feature selection methods 
on the OASIS-3 dataset. 


Table 4 outlines the Macro average F1 score obtained over different algorithms. The training process is 
also shown in Fig. 1. We performed multiple experiments comparing the effect of metaheuristic methods 
on optimizing the classifiers. Results show metaheuristics increase the macro average F1 score between 
4.31% to 19.38%, We can remark that based on the selected features by different optimizers, the 
performance of the PSO with the XGBoost classifier outperforms other optimizers and classifiers 
because it reaches 85.13% macro average F1 score which is about 9.48% better than using XGBoost 
alone. This proves PSO optimized XGBoost classifier’s future performance on the unseen data, and 
hence it can be used as a candidate method for Alzheimer’s diagnosis. 


5 | Conclusion 


This paper tackles the optimization of classification methods for AD diagnosing problems. The ML 
classification methods ate modified, and feature selection is applied in hybrid mode. Four metaheuristic 
methods are proposed besides ML methods and compared against the previous ones. To assess our 
proposed method, OASIS3 dataset is used. This dataset contains MRI images for people aged 42 to 95 
years old. The dataset is highly imbalanced and finding a proper solution for AD classification in this 
dataset is difficult. 


Evaluation is performed using appropriate measures (Macro-average F1 score) for imbalanced data and 
results compared. We found that the performance of classifiers can be improved significantly by using 
appropriate features, and metaheuristic algorithms can help to find the best features for medical 
classification problems. The proposed method can be improved by using other metaheuristics. 
Purthermore, other ML or deep learning algorithms can be applied to investigate the solutions. Finally, 
another way of future research could be to study the effect of hyperparameters values on performance. 
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